This is a "half throughput" solution. It's essentially the top half of the other solution. It can be used to understand + debug the full throughput solution (and IMO makes a nicer gif). 